搜索资源列表
java-cluster.zip
- 用java语言实现文本聚类,包括聚类前的数据预处理:分词、降维、建立向量空间模型等,Implementation using java language text clustering, including clustering of the data pre-processing before: segmentation, dimensionality reduction, set up, such as Vector Space Model
textcluster
- 文本聚类算法源码,包含tf.idf计算的实现,采用java语言编写-text cluster algorithm, including the computation of tf.idf ,written by Java
lingpipe-3.6.0
- 一个自然语言处理的Java开源工具包。LingPipe目前已有很丰富的功能,包括主题分类(Top Classification)、命名实体识别(Named Entity Recognition)、词性标注(Part-of Speech Tagging)、句题检测(Sentence Detection)、查询拼写检查(Query Spell Checking)、兴趣短语检测(Interseting Phrase Detection)、聚类(Clustering)、字符语言建模(Character
dlucene.tar
- 分布式全文搜索工具包 可以支持集群 主要使用java開發 比較方便使用-Distributed full-text search tool kit can support the main use of cluster development of more convenient use of java
text_data_mining
- java编写的数据挖掘方面的代码,里面包含有文本分类,作者身份识别方面的java源码,本人亲自参与编写-java code about data mining;include:text cluster ,authorship identification,
An-Introduction-to-Parallel-and-Vector-Scientific
- In this text, students of applied mathematics, science and engineering are introduced to fundamental ways of thinking about the broad context of parallelism. The authors begin by giving the reader a deeper understanding of the issues through a genera
vsm-and-cluster-kmeans
- 对文本聚类的向量空间模型机制VSM以及Kmeans等讲解比较详细,是研究文本聚类的好书籍-Clustering of text vector space model to explain the mechanism of VSM and Kmeans and other more detailed, study of text clustering is a good book
base-on-WEKA-text-cluster-apllying
- weka平台的文本分类测试,源代码为java-Text categorization test weka platform, the source code for the java